arXiv: 1504.04449v3 [quant-ph] 9 Jan 2017 


Decoding quantum information via the Petz recovery map 

Salman Beigi* Nilanjana Datta^ Felix Leditzkyl 

May 23, 2016 


Abstract 

We obtain a lower bound on the maximum number of qubits, which can be trans¬ 

mitted over n uses of a quantum channel Af, for a given non-zero error threshold e. To obtain 
our result, we first derive a bound on the one-shot entanglement transmission capacity of the 
channel, and then compute its asymptotic expansion up to the second order. In our method 
to prove this achievability bound, the decoding map, used by the receiver on the output of the 
channel, is chosen to be the Petz recovery map (also known as the transpose channel). Our 
result, in particular, shows that this choice of the decoder can be used to establish the coherent 
information as an achievable rate for quantum information transmission. Applying our achiev¬ 
ability bound to the 50-50 erasure channel (which has zero quantum capacity), we find that 
there is a sharp error threshold above which scales as y/n. 


1 Introduction 

The capacity Q{Af) of a quantum channel AA, for the transmission of quantum information, is 
referred to as its quantum capacity, and is given by the regularized coherent information of the 
channel [28, 39, 11]: 

Q{M) = lim (1.1) 

n^oo 77 , 

where IdJ^) ■= ^siXp Id-^, p) is the maximum coherent information of the channel over all input 
states p (cf. Definition 2.8). The regularization in (1.1) is necessary in general. However, for the 
class of degradable quantum channels [12] the coherent information is additive, and hence, for such 
channels the quantum capacity is given by the single letter formula Q{J\r) = Id-hf)- 

The expression (1.1) for the quantum capacity is obtained in the so-called asymptotic, memo¬ 
ryless setting, that is, in the limit of infinitely many uses of the channel (which is assumed to be 
memoryless, i.e., it is assumed that there is no correlation in the noise acting on successive inputs to 
the channel), under the requirement that the error incurred in the protocol vanishes in the asymp¬ 
totic limit. It is, however, unrealistic to assume that a quantum channel is used infinitely many 
times. Instead, it is more meaningful to consider a finite number (say n) of uses of the channel and 
to study the trade-off between the error and the optimal rate of information transmission. 

Let Q"'’^(AA) be the maximum number of qubits that can be sent through n uses of a memoryless 
quantum channel M, such that the error incurred in the transmission is at most e (see Section 3.1 
for precise definitions). We are interested in the behavior of for large but finite n, as 
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a function of e. From the above discussion it is evident that = nQ{M) + o(n) for an 

appropriate sequence {en}n>i that vanishes in the asymptotic limit. However, our aim is to find a 
more refined expansion for 

1.1 Main result 

Our main result in this paper is proving a lower bound on of the form 

> nIciAf) + ^nVe{M) + O(logn). (1.2) 

Here 14 (AA) is an e-dependent characteristic of the channel Af which we call the e-quantum dis¬ 
persion (cf. Section 2.2); it takes one of two values for the ranges e € (0,1/2) and e G (1/2,1), 
respectively. Moreover, l>“^(e) denotes the inverse of the cumulative distribution function of the 
standard normal distribution (cf. Section 2.1). We refer to a bound of the form (1.2) as a second 
order achievability bound. 

The significance of our proof of (1.2) is that at the decoder we use the Petz recovery map [34], 
also known as the transpose channel. In previous proofs of achievable bounds on quantum capacity 
using the decoupling theorem [19, 13], the decoding map is only given implicitly, its existence being 
guaranteed by Uhlmann’s Theorem. In our proof however, the decoding map is introduced explicitly 
and depends solely on the code space. We will explain our method in some detail in Section 1.3. 

We also examine the bound (1.2) for the 50-50 (symmetric) quantum erasure channel. This is a 
quantum channel which, with equal probability, transmits the input state undistorted, or replaces 
it with an “erasure state”, the latter being a fixed pure state in the orthocomplement of the input 
Hilbert space of the channel. The capacity of this channel is known to be zero [4] by the No-cloning 
theorem, i.e., Q{Af) = 0 for the 50-50 quantum erasure channel Af. A stronger result that can be 
proved based on ideas from [29] is that for any error 0 < e < 1/2 we have Q'^’^{Af) = 0(1). (Note 
that the error criterion used in [29] is different from ours, so the error threshold there is l/\/2-) 
However, our bound (1.2) implies that Q^’^{Af) > Pl{y/n) for any e > 1/2. We note that this bound 
cannot be obtained from the previous one-shot bounds for quantum capacity [7, 9]. 

1.2 Related works 

Our lower bound (1.2) is reminiscent of the second order asymptotic expansion for the maximum 
number of bits of information which can be transmitted through n uses of a discrete, memoryless 
classical channel W, with an average probability of error of at most e denoted by Such 

an expansion was first derived by Strassen in 1962 [40] and refined by Hayashi [15] as well as 
Polyanskiy, Poor and Verdii [36]. It is given by 

C^’^W) = nC{W) + y/nVeiW) ^'^e) + O(logn), (1.3) 

where C(VV) denotes the capacity of the channel (given by Shannon’s formula [38]) and I4(1V) is 
an e-dependent characteristic of the channel called its e-dispersion [36]. 

In the last decade there has been a renewal of interest in the evaluation of second order asymp¬ 
totics for other classical information-theoretic tasks (see e.g. [16, 15, 26] and references therein) 
and, more recently, even in third-order asymptotics [24]. The study of second order asymptotics 
in Quantum Information Theory was initiated by Tomamichel and Hayashi [45] and Li [27]. The 
achievability parts of the second order asymptotics for the tasks studied in [45, 27] were later also 
obtained in [3] via the collision relative entropy. 

Our second order achievability bound (1.2) is similar in form to (1.3). Nevertheless, its opti¬ 
mality is open. Note that it follows from the strong converse property of the quantum capacity of 
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generalized dephasing channels [46] that, for such channels, /c(A^) is exactly equal to the first order 
asymptotic rate (and not just a lower bound on it) for any e S (0,1). Moreover, from the result 
of [29] it follows that, for degradable channels, the first order asymptotic rate is given by Ici-M) for 
£ G (0,1/2). Our bound (1-2) has recently been shown to be tight up to second order expansion 
for all e G (0,1) for the qubit dephasing channel [42]. 

As explained in Section 4, the bound (1.2) can also be proven using the one-shot achievability 
bound of [29, Proposition 20]. Nevertheless, the proof of this one-shot bound is via the decoupling 
theorem [19, 13]. As mentioned above, in contrast to our method, the decoupling theorem does not 
explicitly provide a decoder. 

The Petz recovery map (or transpose channel) was introduced by Petz [35, 34] (see also [33]). 
In [1] it was shown that, if the Petz recovery map is used as the decoding operation, then the 
average error incurred in sending an ensemble of commuting states through a quantum channel is 
at most twice the minimum error. Later, this map was also used to characterize so-called quantum 
Markov chain states [20]. Furthermore, it was used to study quantum error correcting codes in [31]. 

Our work should be considered as a new step towards understanding the usefulness of the Petz 
recovery map. In particular it would be interesting to see whether the ideas in our work can be 
used to show tight achievability bounds for other quantum protocols such as quantum state merging 
and quantum state redistribution. Another open question in this area is the optimality of the Petz 
recovery map in the Fawzi-Renner inequality [14] for approximate Markov chain states (see [22] 
and references therein for a discussion of this question). 

1.3 Overview of our method: Decoding via the Petz recovery map 

Following a standard procedure, our strategy for obtaining the above lower bound (1.2) is to first 
prove a so-called one-shot lower bound, i.e., a lower bound on in terms of the information 

spectrum relative entropy [45] (cf. Definition 2.7). Then can be estimated using this one- 

shot bound applied to the channel Since by definition, we arrive at 

(1.2) by applying the results of [45] for computing the second order asymptotic expansion of the 
information spectrum relative entropy (see Proposition A. 8). 

To prove the lower bound (1.2) on we first study a related information-processing task 

called entanglement transmission, which entails the transmission of entanglement from the sender 
to the receiver over multiple uses of the channel Af. It is known [2, 25, 11] that in the asymptotic, 
memoryless setting, as well as in the one-shot setting [25, 6], quantum capacity and entanglement 
transmission capacity are related. So we first prove the lower bound of equation (1.2) for the 
entanglement transmission capacity. This result is stated as Theorem 3.6 in Section 3.5. We then 
arrive at the desired inequality (1.2) as a corollary of this theorem (cf. Corollary 4.2 in Section 4). 

In our coding theorem, as usual, we choose the code subspace randomly, yet fixing the decoding 
map to be given by the Petz recovery map. To be more precise, let Na^b be a quantum channel. 
We choose some positive semidefinite matrix Sa at random, and define the code space as the support 
of S. We then fix the decoder to be 

» = r|ojVor;,)j,. (1,4) 

Here is the adjoint map of M (defined with respect to the Hilbert-Schmidt inner product), 

and the map F^, for an arbitrary X is defined by Fj\:(/3) = XpX^. We note that the CPTP map P 
maps any quantum state ub to a state inside the code space suppS^, and hence is a valid decoder. 
The main result of this paper is that with this decoder we can achieve the bound (1.2) on the 
quantum capacity. 
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The technical part of our proof consists of the following steps. We hrst observe that the average 
fidelity, when the decoder is chosen as in (1.4), can be written in terms of the so-called collision 
relative entropy (cf. Definition 2.6). We then obtain a lower bound on the expected value of the 
average fidelity over the random choice of the code space. This lower bound resembles the coherent 
information of the channel, and should be considered as our main technical contribution. 

Our method is a generalization of the one introduced in [3] in which the pretty good measurement 
(or square root measurement) was used as the decoding measurement for the (classical) capacity of 
classical-quantum channels. In [3] it was observed that the probability of correct decoding using 
the pretty good measurement can be written in terms of the collision relative entropy. Then the 
joint convexity of the exponential of collision relative entropy was used to obtain a lower bound on 
the expected probability of correct decoding over the random choice of the codebook. 

Here, for entanglement transmission, we follow similar ideas: We replace the pretty good mea¬ 
surement with the Petz recovery map, and the probability of correct decoding with the average 
fidelity. However, the joint convexity of the exponential of the collision relative entropy is not 
enough to obtain the desired lower bound. We overcome this difficulty by proving a weak mono¬ 
tonicity property of the collision relative entropy under dephasing (Lemma 3.5), which together 
with the joint convexity gives the final result. This lemma and its proof technique may be of 
independent interest. 

1.4 Organization of the paper 

In the following section we fix our notation, introduce the required definitions, and collect some 
basic tools that are needed to prove our main results. In particular, the dehnitions of the relevant 
entropic quantities are given in Section 2.2. Section 3 contains the main results of this paper in 
which we prove a one-shot achievability bound and a second order achievability bound for the 
protocol of entanglement transmission. In Section 4 we study similar bounds for the protocols 
of quantum information transmission and entanglement generation. In Section 5 we discuss the 
implications of our results for the special case of the 50-50 (symmetric) erasure channel. We defer 
some of the proofs of results in the main text to the appendices. 

2 Preliminaries 

2.1 Notation 

Let Bifii) denote the algebra of linear operators acting on a Hilbert space TL. In the following we 
only consider finite-dimensional Hilbert spaces. We denote the set of positive semidefinite operators 
by Vifi-L). Let := {p G Vifi-i) | trp < 1} be the set of sub-normalized quantum states, and 

Vifi-i) be the set of normalized states (density matrices). For a pure state \fi) G H, we use the 
abbreviation = |V')(V'I £ 'Dili.) for the corresponding density matrix. For A G VfihL), we write 
supp(4) for the support of A, i.e., the span of eigenvectors of A corresponding to positive eigenvalues. 
Moreover, we let H^i be the orthogonal projection onto supp(H). For Hermitian X,Y £ B{T-L) we let 
{X < T} denote the orthogonal projection on the span of eigenvectors of X — Y with non-negative 
eigenvalues. Hilbert spaces are often indexed by uppercase letters as in T-Lai and for simplicity of 
notation we denote T-La by TLab- Operators acting on T-La are distinguished by subscripts as 

in Xa G B{'Ha)- The identity operator in B{'Ha) is denoted by Ia, and we often omit it by using 
the shorthand XbYab = {IA <8> Xb)Yab- The completely mixed state is denoted hy t:a = Ia/ d 
where d = dfm'H.A- 
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A quantum channel is a linear, completely positive, trace preserving (CPTP) map A: B{%a) —^ 
B{'Hb)-, which we denote by A: A —)• S or Aa^b- Given a Stinespring isometry Ua '■ TiA Ti be oi 
a quantum channel Aa-^b such that A{pa) = tvE{UAPAU\), we define the complementary channel 
A'^: A ^ E as A^{pa) = The identity channel acting on B{'Ha) is denoted by id^- 

The algebra B{'H) is eqnipped with the Hilbert-Schmidt inner product; for X,Y G BiH) it is 
dehned as {X, Y) = tr(Afy) where X"^ is the adjoint of X. Then for a quantum channel A: A ^ B 
we may consider its adjoint map A* : B ^ A determined by {Xa, A*{Yb)) = {A{Xa),Yb). Let 
Ba = T-Lb he isomorphic Hilbert spaces with dim^yi = dimBB = d and fix bases {\iA)}i=i and 
{|*-B)}f=i ™ them. Then we define a maximally entangled state (MES) of Schmidt rank m to be 

^ m 

\^'ab) ■= ~i= 1*^) ® l*s) G diu ® T-Lm', (2-1) 

y/rn ^ 

where Bm ^ TdA, Tdu’ YBb, and Bm — are isomorphic subspaces with bases {\iA)}'^i and 
{|zb)}™i- If m = d, we abbreviate = \^ab)- define the Choi state ta'b = of a 

quantum channel Aa-^b by ta'b '■= (idyi' where Ba' — ^Ia- 

The inverse of the cumulative distribution function of a standard normal random variable is 
defined by := sup{z G ffi | <h(z) < e}, where ^{z) = Since is 

continuously differentiable, we have the following lemma: 

Lemma 2.1 ([10]). Let e > 0, then 



for some ^ with — e| < 

2.2 Distance measures and entropic quantities 

In this subsection we collect definitions of usefnl distances measures and entropic quantities that 
we employ in our proofs. We refer to Appendix A for a list of properties of these quantities. 

Definition 2.2 (Fidelities). Let p,a £T>(B). 

(i) The fidelity F{p, a) between p and cr is defined as 

-T(p, cr) := llv^v^lli = tr ^J^/pay/p. 

For pnre states €B, the fidelity rednces to F{’ip,ip) = |('0|(/9)|. We nse this definition of 
fidelity even if one of p or u is sub-normalized. We also nse the notation F^(p, a) = (F(p, cr)fi. 

(ii) [32] The average fidelity of a quantnm operation A acting on B{B) is defined by 

Favg(A; B) := [ dp{(j)) {(j)\ A(0) \(p ), 

JipeU 

where d/i((/>) is the uniform normalized (Haar) measure on unit vectors \4>) G B. 

Definition 2.3 (Relative entropy and quantum information variance). Let p G P('H) and a G V{B) 
be such that suppp C suppa. Then the quantum relative entropy is defined as 

D(p||fj) := tr[p(logp-log(T)], 
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and the quantum information variance is defined as 


V (/)||a) := tr [/3(log p - log fj)^] - D{p\\af. 


The von Neumann entropy of a state p G V{'H) is given by H{p) := — tr(plog p) = —D{p\\I), and 
we write H{A) = H{pa)- For a bipartite state pab S the conditional entropy H{A\B)p 

and mutual information I{A;B)p are defined as H{A\B)p := H{AB)p — H{B)p = —D{pab\\Ia®Pb) 
and I{A;B)p := H{A)p - H{A\B)p = D{pab\\pa ® Pb), respectively. 

The following entropic quantities play a key role in our proofs. 

Definition 2.4 (Max-relative entropy; [8, 37]). Let p G T){fH) and a G VifhL), then the max-relative 
entropy L>jnax(/3||o') is defined as 

L>max(/o||o') := inf |a G M I p < . 

For £ G (0,1), the smooth max-relative entropy D^^^{p\\a) is defined as 

F>max(plk) := max(p||'^)* 

pe^e(p) 

where Be{p) := {p G V<{'H) \ F'^{p,p) > 1 — e^} is a ball of sub-normalized states around the 
(normalized) state p. 

Definition 2.5 (Conditional min- and max-entropy; [37, 41]). Let pab £ 'F><{'Hab), then the 
conditional min-entropy H m\A A\B) p is defined as 

iLmin(^|^)p := - min D raax.{,PA b\\IA ® ^B^- 

(tb£-'D{Hb) 

For £ G (0, 1), the smooth conditional min-entropy is defined as 

•“ _ 77min(^|7?)p, 

PAB^^eiPAB) 

and the smooth conditional max-entropy is defined as 

Hf^^^{A\B)p := min maxlogF^(pAB, La ® (Ts). 

PAsSBe(pAs) 

The collision relative entropy is a central quantity in Section 3. Its conditional version was first 
defined by Renner [37] in the quantum case. 

Definition 2.6 (Collision relative entropy). For p G T>[TL) and a G VifH) the eollision relative 
entropy is defined as 

D 2 {p\W) ■= logtr . 

The second order asymptotic analysis in Section 3 relies on the information spectrum relative 
entropy, whose quantum version was first introduced in [45]: 

Definition 2.7 (Information spectrum relative entropy). Let p G a G V{'H), and e G (0, 1). 

The information spectrum relative entropy is defined as 

Dt{p\W) ■= sup {7 G M I tr (p{p < 2'i'cr}) < e}. 
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Finally, we define two important quantities in our discussion, the coherent information /c(AA) 
and the e-quantum dispersion Ve{N) of a quantum channel. 

Definition 2.8 (Coherent information of a quantum channel). Let Af: A ^ B he a quantum 
channel with Stinespring isometry hlj\f. Ba —^ Bbe- Furthermore, for pA G T>{T-L) let \^ba) ^ 

purification of pA and set 

\^RBe) = {Ir ®UAf) I'^ra) ■ ( 2 - 2 ) 

Then the coherent information of the quantum channel for the input state pA is defined as 

Ic{Af,PA) ■= Di^^RB\\lR<Si ujb) = -H{R\B)i^. 
whereas the coherent information of the quantum channel is defined as 

IciAf) ■=maxIciAf,PA)- (2.3) 

PA 

Define the set Sc{Af) of quantum states achieving the maximum in (2.3) as 


5c(AA) := argmax/c(AA,pA), (2.4) 

PA 

then we can introduce the e-quantum dispersion Vs{Af) of the quantum channel Af: 


VeiAf) := 


maXpA&SciAf) ^(wiJS ||Lh (8) cub) 


if e G (0,1/2) 
if eG (1/2,1), 


where for each pA the state corbe is defined as in (2.2). 


(2.5) 


Remark. Since any two purifications of pA are connected by a unitary acting on the purifying 
system R alone, the coherent information as well as the e-quantum dispersion are independent of 
the chosen purification. 


3 Entanglement transmission via the transpose channel method 

3.1 Protocol 

In this section we define the protocol of entanglement transmission and its associated capacity. 
Suppose Alice and Bob are allowed to communicate via a quantum channel Af'■ B{'Ha) ^ B{'Hb), 
where Ba denotes the Hilbert space of the system whose state Alice prepares as input to the 
channel, and Bb denotes the Hilbert space of the output system of the channel that Bob receives. 
We consider the following information-processing scenario [12] in the one-shot setting. 

Definition 3.1 (Entanglement transmission). Let e G (0,1) be a fixed constant. Alice’s task 
is to convey to Bob, over a single use of the channel Af, a quantum state on a system M in her 
possession, such that the entanglement between M and some reference system R that is inaccessible 
to her is preserved in the process. More precisely, we define a one-shot e-error entanglement 
transmission code as a triple {m,BM,TA), where Bm fL Ba is a subspace with m = dimBM and 
V- B{Bb) -a B{Ba) is a CPTP map such that 

F,^t{Af-,BM) ■■=F\^^j„{idR®VoAf){^^A)) > 1-^, (3.1) 
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where is a maximally entangled state of Schmidt rank m = dim'^M between the snbspace 
'Hm C T-La and a reference system R. Note that as defined above, depends on P, but 

we suppress it for notational simplicity. 

The £-error one-shot entanglement transmission capacity of the channel M is then defined as 
:= supjlogm : 3{m,RM,T^) such that Fent(AA;'HM) > 1 — e}- (3.2) 

We define 

«'(JV) ;= Qli‘ (V®"), (3,3) 

and call it the e-error n-blocklength entanglement transmission capacity of the channel M. 

For the n-blocklength capacity given by (3.3) above, the corresponding capacity in the 

asymptotic memoryless setting is defined as 

Qet(AA) := lim lim -^"^’^(A^). (3.4) 

e^O n—>-ao n 

By the Lloyd-Shor-Devetak theorem [28, 39, 11] it is given by the following expression: 

Qet(AA) = hm -4(AA®-). (3.5) 

n—^oo Tl 

Note that if J\f is degradable, then by the additivity of coherent information for degradable 
channels [12] the regularized expression in (3.5) reduces to the single letter expression 

QetiAf) = (3.6) 


3.2 One-shot bound 

The main result of this section is a lower bound on the one-shot e-error entanglement transmission 
capacity. We then use this lower bound in Section 3.5 to prove a lower bound on the n-blocklength 
capacities. Throughout this section, we abbreviate p = pA- 

Theorem 3.2 (One-shot bound). LetAf: A ^ B be a quantum channel and fixe G (0,1). Then for 
every density matrix p, the one-shot e-error entanglement transmission capacity, defined through 
(3.2), satisfies: 

> min|i:)fi(idR(g)Ar(T^)||/n<8)Ar(p)) + log^^^—(T^||/ij (g) p) -h log ^. 

L 1 — ei 1 — £2 J 

Here, is a purification of p, and for i = 1,2 the parameters Sj, hj > 0 are chosen such that 

e = (^ei + -\/tr(p2) -|- 62^ d) ’ 

with d = dim Ha cind 0 < dj < e,. 





Outline of the proof: As explained in the introduction, the proof of this theorem follows from 
similar ideas as those in the proof of the achievability bound of [3] for the classical capacity of 
quantum channels. Our proof consists of the following steps: 

(a) Using random encoding at the sender’s side 

(b) Fixing the decoding map to be the Petz recovery map (transpose channel) 

(c) Writing the average fidelity in terms of the collision relative entropy 

(d) Applying weak monotonicity under dephasing (Lemma 3.5) 

(e) Using the joint convexity property of the collision relative entropy (Lemma A.5) 

Step (a) is fairly standard and amounts to choosing a random subspace as the code space. We 
use ideas from [18] to determine the distribution according to which we pick this random subspace. 

Steps (b) and (c) are borrowed from [3], where the pretty good measurement was used to decode 
a classical message. There it was observed that the (average) probability of successful decoding 
can be written in terms of the collision relative entropy. Here for the entanglement transmission 
protocol, the pretty good measurement is replaced with the Petz recovery map. Note that in the 
case of classical-quantum channels the transpose channel reduces to the pretty good measurement; 
hence, we can regard the former as a generalization of the latter. It is easy to see that the average 
fidelity for entanglement transmission, obtained by the transpose channel method, can be written 
in terms of the collision relative entropy. 

In [3], in order to prove the achievability bound for the capacity of a classical-quantum channel, 
the final step was to use the joint convexity property of the collision relative entropy similar to 
step (e). However, this property by itself is not sufficient for obtaining the desired lower bound. 
To overcome this problem, in step (d) we prove a key result about a weak monotonicity property 
of the collision relative entropy under dephasing (Lemma 3.5). This lemma should be considered 
as the main new ingredient of our method. After proving it in Section 3.3 below, we then proceed 
with the proof of Theorem 3.2 in Section 3.4. 

3.3 Weak monotonicity under dephasing 

In the following, ^ is a Hilbert space of dimension d with the computational orthonormal basis 
{|1),..., |d)}. Later we will take % to be T-La, the input space of the channel. For a vector 
\ip) = ci\l) + ■ ■ ■ + Cd \d) we define 


:= Cl |1) H -I- Cd |d) , 

where c* is the complex conjugate of Cj. 

Definition 3.3 (Dephasing map). Let U he a unitary operator acting on T-L. Then the vectors 
\ui) := U\i) for i = 1,... ,d form an orthonormal basis for Ti. We define the dephasing map Tu 
associated with U by 

d 

Tuip) ■= ^ \ui){ui\p\ui){ui\. 

i=l 
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For any unitary operator U, define the unitary operator Zjj through the relation 

Zf/ \uj) = 1^.) , 

where {\ui)}^^i is the basis defined above with respect to U. Moreover, for every X S B{Ti.) let Fx 
be the superoperator defined by 


Tx{p)-.= XpX^. (3.8) 

The following characterizations of the dephasing map Tu follow from straightforward calcula¬ 
tions. In the sequel, we abbreviate ui = \ui){ui\ and u* = lnp(u*|. 

Lemma 3.4. 

(i) Let = 1 Yli=i K) ® N) ® maximally entangled state. Then for every unitary operator 

U we have 

1 ^ 

{Tu (8) idA)(4>) = - u*. 

i=l 

(ii) For every unitary operator U and p G V{'H), we have 

Tv(p) = \^ZlpZp = 

j=0 j=0 

Now we are ready to state and prove the lemma about weak monotonicity under dephasing. 

Lemma 3.5 (Weak monotonicity under dephasing). Let A: A ^ B be a completely positive map 
and |$)_Ryi = I*) ® 1^) ® maximally entangled state. For a fixed unitary operator U, let 

urb be a state satisfying Tz^ 0 idBicTRs) = o'rb- Then we have 

expZ )2 {Tu <S> A{^RA)\\crRB) > ^expL >2 (idij(8)A(4>iiA)lkHB) • 

Proof. We use the following notation: Fix a unitary operator U, and for 0 < j < d — 1 define 

^ d—1 

pRB '■= ^Zu ® M^Ra) and PRB — -iYl PRW 

j=0 

By Lemma 3.4 we have prb = Tu ® A(<I>Ryi), which implies that 

TZjj ® ids(pRs) = Prb- (3.9) 


10 


We then compute: 


expD 2 {Tu (8) A(^>)||cj/jb) = expD2 (prbIWrb) 

= tr 


d-l 


^J]tr r^^^®A(cI>).a-'/V-V2 


i=o 

d-l 


d 


^tr idij (8)A(<h) • (g) ide pa 

j=0 
= tr [i 


a-i/2pa-V2 


1 


d-l 


-^tr idij(8)A($) 




i=o 


> - tr 
d 


idij(8)A($) • (it 


= -expD 2 (idij(g)A(d>)||cr), 


where the fifth equality follows from (3.9) as well as the assumption that T® idB(cr) = a, and 
the inequality follows from the fact that both the operators id/j (8)A(<1>) and (for all 

j) are positive semidefinite. □ 


3.4 Proof of Theorem 3.2 

Let PA be an input state of the channel M'■ A ^ B. We assume without loss of generality that p 
is full-rank, since otherwise we may restrict T-La to the support of p. In the following we use the 
notation p = dp. Furthermore, we let 

^RA = ® VP)^Ra{Ir ® VJ) 

be a purification of pA- 


Code construction: Let m be a positive integer to be determined. Let P be a rank-m projection 
acting on T-La- Later we will assume that P is chosen randomly according to the Haar measure, but 
for now we assume that P is fixed. Define 


S := 

and let Lf^ be the projection onto the support of S, i.e., 

ns = 5-^/2 

We choose supp(S') as the code space. Since p is full-rank, S has rank m. As a result, the code 
space supp(5) is of dimension m. 

With the above construction, for every |?/)) £ supp(P) we have \/p|V’) G supp(S'). Moreover, 
5'-n2^l^) £ supp(S') where 5 ^ is computed on its support. 

Let us fix some orthonormal basis {|ui) ,..., \vm)} of supp(P), and dehne 
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Then for every i we have \wi) G supp(S'). Furthermore, 

m m 

\w^) = E {^i\ = s-^/^^pP^pS-^/^ = ns. 

i=l i=l 

This implies that {|rt;i) ,..., \ wm)} is an orthonormal basis of the code space. In particular, for 
every |?/)) G supp(P) we have 


II^■'/'V^IV^)II = 1■ (3.10) 

Decoder: For the decoder we choose the Petz recovery map (or transpose channel): 

V:=TfoM*oT-^/^y (3.11) 

Recall that Fx is defined in (3.8) and M* \ B ^ A \s the adjoint of N. It is easy to verify that 
V\ B ^ A\s a CPTP map, and hence a valid decoder. It satisfies the property V[M{S)) = S. 

By the definition of the adjoint map, for every |(/)) G supp(5) we have 

f2(<(,, V o Mm = tr (</. • o M* o p-y/) o 

=tv(Mo Fy^((/>) • o M{(i))^. 

In particular, if we choose Icp) = S~^^'^y/p\'i/j) for some |^) G supp(P), then, it follows from (3.10) 
that \(p) is normalized, and 

Flip operators: The flip operator (cf. Definition B.l and Lemma B.2) of the code space is given 
by 


Fs = E 1"^*) (^il ® I'^i) (^*1 

*j=i 

= (s-^/^^p ® Fp ( ^ ^ 5 - 1 / 2 ^ , 

Average fidelity of the code: We are now ready to compute the average fidelity of the code 
supp(5): 


-^avg(ns) := Favg(AA;supp(5')) 


/ 

J Id 


|<^>Gsupp(S’) 




'|0>esupp(S) 


d/r((^) tr f AA o ■ Fo A^i 


.-1/2 


'|0>esupp(S) 


d/r((^) tr (^Fp ■ M o Ty^{(j)) (g) Px-^y^) ° 
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where fi{-) denotes the Haar measure, and Fb is the flip operator corresponding to Hb- Using the 
deflnition of the adjoint map and Lemma B.2, we continue: 


-Favg(ns) = / d//(</.) tr ( (Fb) 

J\ 4 ')&snpp{S) ^ 

= + 1) ° A/" ® (Fb) ■ (ns ® ns + -Fs)) 

= 1) (Fb) ■ (Fs-./.^ ® Fs-.,.^) (P ® F + Pp)) 

= f dM?*)tr((rf oAP®ACor;y'g)(FB) 

p'|f/>)esupp(P) •'MF’V 

■ (^5-1/2^ ® {tp ® V’)) 

= [ d^{'ip)tr (Fb ■ (^f ovy^ O rg_i/2^) (ip) ® (^)) 

J |V'>6supp(P) A \ / \ ^ ' / / 

= [ dfi{'ip)tT ((^f ovy^ oTs- 1/2 A (ip) • oAror5-i/2^) (ip)) 

= [ d^i{ip)ti ((aPotA ilP) ■ oAToL^-i/a^) {lP) \ , 

where in the last line we used S^/‘^S~^/‘^y/p\'ip) = Ylsy/p\'ip) = \/p|V’) for every \'ip) S supp(P). 

Average fidelity in terms of collision relative entropy: We now express the average fidelity 
in terms of the collision relative entropy. 

fo ((-^° ^Up) ^s-V2^) [ip]) 

= tr ( (a^ o r^) (^) • o AA o r^) (v^)) 

+ tr ( (aa o r^) (^) • (r-yy^ o M o (r^-- r^)) pp)) 

= fo (P^{\/p'P'/p) ■ AA(5)"fo^ • M{^/pip^/p) ■ AA(S')"fo2) 

+ tr ((r-yy o AAo r^) (^) . M 


> expD2 (^M{^/p'ip\/p)\\J\f{S)^ 

~ (V’) • Ad ^^^5-1/2 _ ^ 


Therefore, 


Aavg(n5) > / d/r(V>)expT>2 (Ad(Y^V’\/^)l|Ad(S')) 

J |i/')Ssupp(P) ' 

dKP’) II (r;^(g ° Ad o r^) iP;) • Ad ( 5 - 1/2 


'|V'}6supp(P) 
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Random code space: We now assume that the projection P (of rank m) is chosen randomly 
with respect to the Haar measure. Then the code space supp(S') itself becomes random. Hence, we 
can bound the expectation of the average fidelity of our code by 

[ d/r(P)J-avg(ns) 


Jp 7|'0)Gsupp(P) ^ ' 


- ^ d^l{P) d^(V’) II (r-'(g o aa o r^) (V^) • M ( 5 - 


=: Ti - Ts, 


1 

(3.12) 


where the integration in the third line is again over S supp(P). We refer to the terms appearing 
in the second and third line as Ti and T 2 respectively. To analyse the first term Ti, we use 
Lemma 3.5. Note that the term T 2 vanishes if p is a maximally mixed state (and correspondingly 
is a maximally entangled state). 

Analysis of the first term in (3.12): The first step in analysing Ti in (3.12) is to use Fubini’s 
theorem to change the order of integrals: 

Ti= [ d//(P) [ d/z(V')expZl2 (aA(v^t/>V^)||AA(v^Pv^)') . 

Jp J\ii)&snpp{P) ^ 2 

In the above integral the state \'ip) G supp(P) is distributed according to the Haar measure. Fur¬ 
thermore, for any such state |?/^) there is a projection P' of rank m — 1 such that P = ip + P'. 
Indeed, if we let 

:= {\v) ■ {iPl v) = 0, |u) G T-La], 

be the orthogonal subspace of \ip), then P' is a projection onto some (m — l)-dimensional subspace 
of \ip)'^ ■ Moreover, this projection P' is distributed, independent of \ip), according to the Haar 
measure. Putting these together, we find that 

Ti= [ di^{ip) [ d/r(P')expL >2 (aA(v^V’V^)II-^(VpV’V^)+AA( v^P'v^)) . 

Jip JP':supp(P')C|V)>-^ ^ 2 

The next step is to use the joint convexity of the exponential of the collision relative entropy 
from Lemma A.5. We obtain 


Pi > y d^i(V')expH2 ^AA(v^i/>V^) 

= j d/i(?/))expH2 ^AA(v^'0V^) 

where in the second line we use 


, d^l{P')J\^{^/^P'^/^) 

J P':supp{P')<Z\tpP ^ 


f P':supp(P')C|’0) 


d/r(P')P' = ^^(/-V’). 


d-l 


Then, using p = dp the above bound can be written as 


Ti> dp{1p)expD2{^s^{^/p1p^/p)\\a^J'{y/^1|J^/^) +/3M{p)), 
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where 


(3.13) 


a = 


d — m 
d-l 


and /? = 


d{m — 1) 
d-l ' 


Note that a + fi = m. 

To choose the random (Haar distributed) vector |V’), we may first choose a random (Haar 
distributed) unitary operator U and then take IV') = for a hxed i, where \ui) is given by 

Definition 3.3. We then have 

/* 1 ^ 

Ti > / aJ\f^/^) +/3M(p)^ 

i=i 

f 1 ^ 

= / dp{U)-'y^exp D 2 {uj® M{-\/pu*y/~p) aui®M{^/pu*y/~p)+l3ui®M{p)^ 
i=i 


> 


'U 


(lp(U) exp -D 2 I - ^ ti* 0 N{^/pu*y/p) 


2 = 1 


^ ® a/ 5 ) + (d-KR ® N{p) j , 


2=1 


where the last inequality follows from the joint convexity of expD 2 (-||-). 

We now use Lemma 3.4 and Lemma 3.5 to obtain: 

Ti> f d/r(17)expD2 (^7c/(8) (AAor^)(4>) aTu <S'{Af oT^){^) +/3 ttr'S'A f (p)^ 

J JJ 

> j dp{U)^expD2 (^idRiS'iAf oT^){^) aTu'S'(Af oT^){‘^) +/dnu (S'Af (p)^ ■ 

Note that here we applied Lemma 3.5 with the choice A = M oT^ and 

(Trb = OiTu S {M oT^){^) + (d-KRSMip), 

which satisfies TZjj S idsicrRE) = ctrb- 

Once again using the joint convexity of expD 2 (-||-), we find that 

Ti > ^expD 2 ^idK<8)(AAo r^)(4>) a j^dp{U)Tu S (J\foT^){^) + (^ttrS J\f{p)^ ■ 

Using Lemma 3.4, we compute 

/ d/i(t/) Tc/® idA(<h) = / d/r(U)iV: 

Ju Ju 

= f d^(V’)V' S V’* 

Jp 


Ui 0 $ Ui 


Ip 

= x4> + y-KR S VTA, 


where 


X = 


d+ 1 


and y = 


d+ 1 
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Therefore, 

Ti > ^expT )2 (^idij(8)(AAor^)(^>) (ax) id^ <8)(AA o r^)(^>) + {ay + I5 )ttr® N{p)^ 

= ^expT )2 (^idij(g)(AAor^)(^>) a'idij (8)(AA o r^)(^') +/S'tt/j ( 

= expT)2 (idij(8)AA('I'^)||a' idj? (8)AA('I'^) + P'ttr ®M{p)) , 

where we used = id/j (8)r^(<h) and 

,1 {d — m) 

“ = —“3; = ———— 7 -;——, 

m m{d — l){d + 1) 

1 , {d — m)d d{m — 1) 

P — —(«y + /3) — 77 - -I W , , S, H-77-TT- 

m m(a —l)(a + l) m(a — 1) 

Observe that a' + P' = 1, which follows from (3.13). 

Finally, we use Lemma A.6 to bound the collision relative entropy by the information spectrum 
relative entropy. For any di G (0,1) we obtain 

Ti >^{l-Si) a'+/3'exp ^-Dfi(idH(8)Ar(^^)||7rij (g) Ar(p))^ 


md 

Let us fix El G ((5i, 1). Then Ti > 1 — ei follows from assuming that 

^(1 - 61 ) a' + /3'exp (^-L>fi(id/j(8)Ar(T^)||7ri{ (8)Ar(p))^ 


1 -1 


md 

This inequality is equivalent to 


> 1-ei. 


exp 


-Di^ {idR ®AA(T0 IIvtr ® AA(p))) 


< 


l-,5i 


a 


mdf5'{l — £i) P' 
Using the facts that /3' < 1 and a'//3' < l/(md), this inequality holds if 

1 — 5i 1 £\ — (5i 


exp 


(idi? (8 )AA(T^)IIttjj (g) M{p))^ 


< 


md{l — £ 1 ) md md(l —ei)’ 


which is equivalent to 


log m < (idij (g)AA(T^) lIvTij (g) J\f{p)) — log d + log 


gi — 

1 - ei 


= Zlfi(idii®AA(T0||//jOAr(p)) +log 


£1 - 5i 


1 - ei 

In summary, we obtain Ti > 1 — ei for £i > 0, provided that for some 0 < < ei we have 

£l — (5i 


logm < Zlfi(idij(g)AA('I'^)||/ij (g) Ar(p)) + log 


1 -ei 


(3.14) 


Analysis of the second term in (3.12): The main ideas in this part have already appeared 
in the analysis of the first term of (3.12) above, so we leave the details for Appendix D. There, 
assuming 

logm < (g) p) + log ^—— 

L — £2 

for 0 < (52 < £ 2 , we show that 

r| < 1 + tr(p^) - (1 - £ 2 ) < tr(p^) + £ 2 . 
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The last step: Putting the above bounds on Ti and T2 together, we conclude that 



d/i(P)J’avg(P’) >Ti-T2>l-ei- \/tr(/92) +£3 


holds if 


logm < min I Dg^{idR 'S>Af{p)) + log ^ — p-, Dg'^{d)P\\Iji 0 p) + log 


l-£l 


1-£2 


Then, using Lemma A.l and writing the entanglement fidelity in terms of the average fidelity, we 
conclude that there exists a projection operator P such that, for the code space supp(S'), where 
S = ^/pPy/p, we have 


with £ defined by 


This means that 


Fent(AA; supp(S')) >l-£, 


e = (ei + Vtr(p2) 



> log min 


exp 


(id/j ® AA(p)) + log 


gi - Si 
l-£l 


exp 


Di^idiqiR0p) + iog 


£2 — ^2 
1 -£2 


which gives the desired bound in Theorem 3.2. 


3.5 Asymptotic expansion 

We now prove the main theorem of this section: 

Theorem 3.6 (Second order achievability bound). Let J\f'■ B be a quantum channel and fix 
£ € (0,1). Then we have 

QTW > nlfiM) + VnVfiAf) (e) + O(logn), (3.15) 

where the coherent information Id-l^) and the £-quantum dispersion YfiN) are given by Defini¬ 
tion 2.8. 

Proof. In the following we assume that (idj? (8)AA)('I'^) is not a pure state, since otherwise the channel 
Af is an isometry for which the problem of quantum information (or entanglement) transmission is 
trivial, and the claimed achievability bound is immediate. In other words, letting Ujg'. TLa Pbe 
be a Stinespring isometry of M, we assume that in 

a^RBE = (id_R<8)Z^Ar)(^^) 

the environment E is not completely decoupled from R. This implies that I{R]E)i^ > 0, which is 
equivalent to 


H{Bd - H{RBd < H{Rd = H{p). 


(3.16) 
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By Theorem 3.2 we have the following bound on for an arbitrary input state pn G 

vinT)- 


Qei%-^) > <8)AA(T^")||//{n (g) N{pn)) + log 


£i — 

1-ei ’ 


® Pn) + \og 


£2 — ^2 
1 -£2 


(3.17) 


where £*, <5* > 0 for i = 1, 2 are chosen such that 

e = (^ei + Vtr(7'^) + £ 2 ) , 

with d = dim'H^ and 0 < d* < e*. In the following, we restrict our consideration to input states of 
the form p®” where p G V{'Ha)- 

Fix £ > 0, and for sufficiently large n define ei > 0 by 


£l = £ — 


Vn 


Furthermore, define 


£2 = 




such that (3.7) holds. Then for sufficiently large n, we have 


£2= ^+£ IT- -1 


n 


1 


-1 


-tr((p®")2) 


> 


--tr((pn2) 

\y/n 1 + d^ J VVF ; t 

-tr((p®")2) 


9 

>- 


> 


n ^/n(l + d^) 
4 


n 


Note that here we assume that p is not pure (since otherwise there is nothing to prove), so that 
tr ((/O®"")^) tends to zero exponentially fast in re. Finally, let 


oi = £i - -= = £ - -= and 02 = £2 -> — • 

/re y're re re 


(3.18) 


Then by Proposition A.8 and Lemma 2.1, we obtain the following expansion for the first term in 
(3.17): 


L»fi(idi?"®A/®”((^'’)®’")||7i?" ®A/®"(p®")) + log^^^ 

1 — £1 

= reH(idre<8)A/('I'^)||/re ®M{p)) + Vre'F<l>“^(e) + 0(logre), (3.19) 
where V ='F(idre(8)A/(T^)||/re ® A/(/3)). 
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In Appendix E we show that 

Dl^({^P)^^\\lRn (g) p®”) + log > nH{p) - O(lognv^). (3.20) 

I — £2 

Observe that 

® AA(p)) = H{B)^ - H{RBU 
where oorbe = (id/? Then by (3.16) we obtain 

D{idR®M{^P)\\lR®M{p)) = H{B)^-H{RB)^ < H{R)^ = H{p). 

Therefore, for sufficiently large n, the expression (3.19) is less than (3.20), and hence the lower bound 
in (3.17) is given in terms of the first term. That is, for every e > 0 and arbitrary p G T){'Ha) we 
have 

Qei^i-^) > nD{[dR<S'^f{^'^)\\IR<Sl^f{p)) + y/nV{idRiS'M{^P)\\lR ® Af{p))^~^{£) + 0(logn). 

(3.21) 


Since (3.21) holds for any arbitrary input state p G we have 

max |nIl(idi?(8)AA('I'^)||//j (g) AA(p)) + y/nV{idR^J\f {'I/p)\\Ir 0 Af {p))^~^{£)\ 

P&ViH) 1 J 

+ O(logn). 

> max |nT>(idij(g)Ar('h^)||/ij ^Af{p)) + y^nV{idR^Af{d)P)\\lR ^ Af {p))^~^ {£)\ 
peSciJV) 1 J 

+ O(logn), 

where iSc(AA) is the set of states defined in (2.4). We have 

Zl(idfi®AA(TO||/K® AA(p) = IciAf) 

Sc{Af), and noting that (e) < 0 for e < 1/2 (resp. 4)“^ (e) > 0 for e > 1/2), 

^ (nIc{M) + ^nminpg 5 ^(^) E(AA,p) $-1 (e) + 0(log n) if e G (0,1/2) 
ynlciAf) + ^n maXp^SciJV) ^P) (e) + 0(log n) if e G (1/2,1) 

I/(idi{(g)AA(T^)||/i{(g)AA(p)). The proof is then completed by employing definition 

□ 

4 Quantum information transmission and entanglement genera¬ 
tion capacities 

In this section we use our main result. Theorem 3.6, to prove similar lower bounds on two other 
invariants of quantum channels, namely quantum information transmission capacity and entangle¬ 
ment generation capacity. 

In quantum information transmission, Alice’s task is to convey to Bob, over a single use of 
the channel M, an arbitrarily chosen pure quantum state, from some Hilbert space Rm ^ Ra of 
dimension m, with an error of at most e. More precisely, we define a one-shot e-error quantum code 


for all states p G 
we obtain 

QTm 

where V (Af, p) = 
(2.5) oiVeiM). 
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as a triple ,1^), where Hm ^ 'Ha is a subspace with m = dim'HAf and V : B{Hb) —^ B{Ha) 

is a CPTP map such that 

F{N]Hm)'-= min F'^ {ip, {V o Af){ip)) > I — e. (4.1) 

M&y-M 

Here the embedding T-Lm ^ Ha is Alice’s encoding map and V is Bob’s decoding map. Note that 
by the concavity of the fidelity, (4.1) ensures that the code {m,'HM-,'F>) could be used to convey 
any mixed state p e V{T-Lm) with F^ {p, {V o M o S){p)) > 1 — e. 

The e-error one-shot quantum capacity of the channel M is then defined as: 

Q^’'^{M) := supjlogm : 3{m,HM,'F>) such that F{M;Hm) > 1 — e}- (4:.2) 

For n successive uses of the memoryless channel, we define the e-error n-blocklength quantum 
capacity of the channel M as 


Q^’"(Ar) (4.3) 

We now turn to the information-theoretic task of entanglement generation. In this scenario, 
Alice wishes to generate entanglement shared with Bob (as opposed to preserving it as in the 
entanglement transmission scenario). To this end, Alice locally prepares, without loss of generality, 
a pure bipartite state |Tr^) e T-Lb. ® Ha whose A-part she sends through the quantum channel M 
over a single use of the channel, while retaining the i?-part. The preparation of this state comprises 
Alice’s encoding operation. The goal for Alice and Bob is to generate a maximally entangled state 
^'ba Schmidt rank m. Bob applies a decoding operation P: B —)■ A to the output of the channel 
(which he receives), resulting in the final state (id/j (8)P o AA)(T/j^) shared between Alice and Bob. 
The triple {m,'T ba^'F) is called an e-error code for entanglement generation if 

i"eg(AA; Tba) ■■= (IdB O M){Tba)) >l-e. (4.4) 

The e-error one-shot entanglement generation capacity of the channel Af is then defined as; 

•= supjlogm : 3{m,TBA,F) such that Feg{Af-,TBA) > 1 - e}- (4.5) 

As before, we let 


Qeg%^) ■■= Ql’g , (4.6) 

and call it the e-error n-blocklength entanglement generation capacity of the channel J\f. 

Having defined one-shot capacities for the tasks of entanglement transmission, Q],[^{J\f), entan¬ 
glement generation, Qeg^(A^), and quantum information transmission, Q^’^{M), we now relate the 
three different definitions to each other. 

Lemma 4.1. For every e G (0,1) we have 

Qlf{M) < 

Proof. For the first inequality note that Alice, in an entanglement generation protocol, can fix Tba 
to be equal to 4)^^. Then Qll^{Af) < Qlg^{Af) follows from the definition of the two quantities in 
(3.2) and (4.5), respectively. 
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To prove the second inequality, assume that Alice and Bob successfully implement an e-error 
entanglement generation protocol, for some e € (0,1). Suppose that the final state of the protocol 
is given by (JrA) where the systems R and A are with Alice and Bob respectively. Then, 

i"eg(AA; aRA) >l-e. (4.7) 

Let Rm ^ Rr be the support of the marginal of Since aRA is close to Alice may use this 
entangled state shared between them to teleport [5] any state \ip) G Rm to Bob. This teleportation 
protocol needs forward classical communication, which by Lemma C.l can be granted for free. 

To analyse the error of the whole protocol, let us denote the teleportation part of the protocol 
by the CPTP map T which acts on if 0 aRA- Then, using (4.7) we have for an arbitrary \ip) € Rm 
that 


F‘^{ip,T{ip ® aRA)) = F‘^{T{ip^^RA)^T{(p^aRA)) 

> F‘^{ip (g) If (g) aRA) 

= F\^]i^,aRA) 

> 1-e, 

where in the first line we used the fact that if Alice and Bob share a maximally entangled state 
^raj then they can use the teleportation protocol to perfectly transfer ip from Alice to Bob. In the 
second line we use the monotonicity of the fidelity under CPTP maps. Therefore, the entanglement 
generation protocol followed by the teleportation protocol allows Alice and Bob to transmit any 
state \ip) G Rm with dhnRM = iR with an error of at most e. Hence, the claim follows. □ 

Now as a corollary of the above Lemma 4.1 and Theorem 3.6 we obtain the following: 

Corollary 4.2. For any quantum channel Af and any e G (0,1) we have 

Q^’%M) > Q2^%M) > > nh{M) + [e) + O(logn). 

We note that the above lower bound on Q'^'^{M) and Q'^g{M) can also be proved using the 
one-shot achievability bound of [29, Proposition 20] (see also [42]). This one-shot bound is proved 
based on the decoupling theorem, and we state it here for completeness: 

Proposition 4.3. [29] LetM'- B he a quantum channel with Stinespring isometry Ujy: Ra 
Rbe, let PA G V{Ra), eind for an arbitrary purification 4'^^ of pA define \ujrbe) ■= {Ir ® 
7(y)l4'^yl)- Further, lets G (0,1) andt] G (0,e). Then the s-error one-shot entanglement generation 
capacity satisfies the bound 


QeV(-^) > + 41ogr?. 

Using the second order asymptotic expansion of the smooth max-relative entropy from Proposi¬ 
tion A.8, Proposition 4.3 immediately yields the second order achievability bound of Corollary 4.2 
for Qeg^{Af), and then by Lemma 4.1, for Q'^’^{Af). The advantage of our proof, however, is that 
we first bound Qei^{Af), which by Lemma 4.1 is a smaller quantity. Moreover, in our method the 
decoder is given explicitly as the Petz recovery map (1.4). 
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5 Example: 50-50 erasure channel 

Let M'■ A ^ B he the 50-50 (symmetric) erasure channel, which has zero capacity [4] by the No¬ 
cloning theorem. In this section, we study the e-error n-blocklength capacities of AA, based on ideas 
from [29]. 

Let e G (0,1/2) and m = exp^(A^). Then there is a code with 

Fent(AA®";^M) = (idi^n > 1 - £• (5.1) 

Furthermore, let \u}Rn^n^n) be a purification of (id/jn (g)AA®"')($^„^„). Note that since AA is a 
symmetric channel, can be chosen to be symmetric with respect to the exchange of the 

subsystems and E^. 

It is easy to verify that for any state we have 

, /ijn ® (7 <8* CTy^ri j < -. (5’2) 

On the other hand, by (5.1) we have G B^{ORnj^n) where Or^A’^ ■= i^dRn iSi'D){iORnsn). 

Therefore, 

Fi^AR'^\A"-)e = min maxlog {ORr^^A^Jn^ ^ o-yl") 
eeB^ie) 

< maxlog ® ga^) 

a-A-n. 

< — logm, 

where we used (5.2) in the last line. We continue to bound: 

= log"^ 

_ u'/^ ( pni Tpn\ 

+ (5.3) 

where a is chosen such that sin a = y/e. Here, in the third line we used the data processing 
inequality for the smooth max-entropy. Lemma A.3(iii). In the fourth line we used the duality 
relation for the smooth min- and max-entropies, Lemma A.3(i). In the fifth line we used the 
symmetry of ujr^b'^E”, and in the last line we used Lemma A.3(ii) as well as the fact that e < 1/2. 
The latter implies that a, defined through sina = y/e, satisfies a G (0,7r/4). 

Comparing the third and last lines of (5.3), we find that 

Q:r(AA) < 

cos(2Q;j 

We conclude that we have a constant upper bound on for arbitrary n and e < 1/2. 

On the other hand, consider the case e G (1/2,1). By Theorem 3.6, we have 

> nIc{M) + ^nV,{M) (e) + O(logn) 
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where Ve{M) = V{ijJrb\\Ir ® ^b) for this range of e (cf. Definition 2.8) and ujrb = 

(id/j A straightforward calculation verifies that p) ^ 0 for all p, and that /c(AA) = 

/c(AA, TTyi) = 0. On the other hand, by considering the maximally mixed state as the input state, 
we have 14(AA) > 0 for e > 1/2. It follows that 

Q:d(^^) > ‘fi-' (e) + O(logn), 

and the right-hand side of the above inequality is positive for sufficiently large n since (e) > 0 
for e > 1/2. 

To summarize, for e < 1/2, the e-error n-blocklength capacity Qd^{N) is at most a constant 
independent of n, whereas Qd^{Af) is positive and scales as y/n for e > 1/2. 
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A Properties of distance measures and entropic quantities 


In this appendix we collect useful properties of the distance measures and entropic quantities defined 
in Section 2.2. 

In [21] (see also [32]) the following relationship between the average fidelity and the entanglement 
fidelity was proven: 


Lemma A.l. [21] For any quantum operation A acting on B(T-L) with d = dim.'H, the average 
fidelity and entanglement fidelity are related by 


Fa,vg{^] T-L) 


dFenti^] T~L) + 1 
d + l 


The relative entropy and the quantum information variance satisfy the following duality rela¬ 
tions: 


Lemma A.2. Let \'fiABc) be a pure state with corresponding marginals pab,PACi o,nd pbc, then 

D{pab\\Ia® Pb) =—D{pac\\Ia® Pc) o-nd y{pAB\\lA®PB) = y{PAc\\lA®Pc)- 

These relations have been used in [ 17 ]. Here, we give a proof for the sake of completeness. 

Proof. The relation D{pab\\Ia®Pb) = —D{pac\\Ia®Pc) follows from a straightforward calculation. 
For the second equation, we only need to establish that 

tr [pABiygPAB - Ia® log pb)"^] = tr [pAc{logPAC - I A® logpc)^] , 

or equivalently, that 

tr [pab log^ PAb] + tr [pB log^ Pb] - 2 tr [pAB{log pab){Ia ® log Pb)] 

= tr [PAC log^ PAc] + tr [pc log^ Pc] - 2 tr [pAc{log Pac){Ia ® log Pc)] ■ 

Considering the Schmidt decompositions of \ fiABc) along the cuts ABjC and AC/B, one verifies 
that tr [pab log^ Pab] = tr [pc log^ pc] and tr [pB log^ Pb] = tr [pAc fog^ PAc] ■ It remains to be 
shown that 

tr [pAB{logpAB){lA ® log pb)] = tr [p ac{ log PAc)[IA ® log pc)] , 
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which is equivalent to 


{' 4 ’abc\ (log pab ® Ic){Iac ® log pb) \iPabc) = {'^abc\ {Iab ® log/?c)(log pac ® Ib) \i’ABc) ■ 

Once again using Schmidt decomposition, we find that log pab ® Ic \'4 ’abc) = Iab ® log pc \'4 ^abc) 
and that Iac ® logps \'4 ^abc) = logpuc ® Ib \'4^abc)i which concludes the proof. □ 

The next lemmas concern the smooth min- and max-entropies. 

Lemma A.3. The following properties hold: 

(i) [23] Let pabc a pure state and e G (0,1), then 

(ii) [41] Let pab S LUJ-Lab) a,/3 > 0 he such that a + /3 < 7r/2. Then we have 

< HZf{A\B), + log—1—. 

cos"'(a-I-p) 

In particular, if e,e' > 0 such that e + e' < 1, then 

HLniA\B)p < + log 

(iii) [44] Data-processing inequality for max-entropy: Let pab S II{Hab), ^ G (0,1), and A: B ^ 
D be a CPTP map with tad '■= (idA < 8 ) A )( p ^ s)7 then 

Hl,,,,{A\B)p < Ht^MD)r- 

Lemma A.4. [43, 41] Let p G T){TLab) and e G (0,1), then 

HP^{A^\B^)p^^ > nH{A\B)p - (yW) 
where g{t) = — log(l — y/l — t"^). 

We use the following convexity property of the collision relative entropy. 

Lemma A.5. [30, 48] The function {p,cr) i—>■ eyvpD2{p\\(7) is jointly convex. 

The following inequality between the collision relative entropy and the information spectrum 
relative entropy is used in our proof of the main result. 

Lemma A.6. [3] For every 0 < e, A < 1, p G 'D{TL) and a G V{'H) we have 

expH 2 (p||Ap-b (1 - A)cr) > (1 - e) [A-h (1 - A) exp (-Df (p||a))]"^. 

The information spectrum entropy and the smooth max-relative entropy (cf. Definition 2.4) can 
be bounded by each other: 

Lemma A.7. [45] Let p,(7 G ViPi), e G (0,1), and p > 0. Furthermore, let v{o') denote the number 
of different eigenvalues of a. Then we have 

D^%Ph) < Df+'?(p||(T) -logr/ + logz/(a) -log(l -e). 
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Remark. Lemma A.7 can be obtained by combining two resnlts in [45] which bonnd the infor¬ 
mation spectrum entropy and the smooth max-relative entropy in terms of the hypothesis testing 
relative entropy [47]. These bounds are proved in Lemma 12 and Proposition 13 of [45], respectively. 

The second order asymptotic expansion of the smooth max-relative entropy and the information 
spectrum relative entropy are derived in [45]. While the former is one of the main results therein, 
the latter is only proved implicitly. 

Proposition A.8 . [45] Let e £ (0,1) and p,(T ^ 'DiLL), then we have the following second order 
asymptotic expansions for the smooth max-entropy and the information spectrum relative entropy: 

= riD{p\\a) - y'nV{p\\a) (e^) -h O(logn) 

Zlf(p'^"'||cr®"') = nD{p\\a) -\- y/nV{p\\a) (e) -\- O(logn) 


B The flip operator 

In the proof of Theorem 3.2 in Section 3.4 we make use of the following operator and its properties: 

Definition B.l (Flip operator on a subspace). Let /C C "H be a subspace of the Hilbert space LL. 
We define the flip operator Fx, as the linear extension of the operator defined by the action 

Fa:([^i) (8) \f2)) = IV' 2 ) <8) IV’i) 

for every [V’l)) IV' 2 ) S hC. For a Hermitian operator X £ we define Fx = -Fs^pp^x)- 

The proofs of the following properties of the flip operator can be found e.g. in [13]. 

Lemma B.2. [13] Let JC f LL be a subspace of the Hilbert space H with dim/C = dx, then the 
following properties hold: 

(i) Given an orthonormal basis of 1C, the flip operator can be expressed as 

d-K 

Fx= \vi){vj\ (g) \vj){vi\. 

*j=i 

(ii) For operators X, Y acting on X, we have tr(Ay) = ii{Fx{X 0Y)). 

(iii) The flip operator is idempotent on its support, i.e., Fl = lix <S> Hx where Hx is the orthogonal 
projection onto X. 


(iv) We have: 


(v) More generally. 




(n^ (g) -k Fx) 


j dp{P)P (g) P = yiH^ <g) -k 72 F,»c 


where the integral is taken over rank-m orthogonal projections with respect to the Haar measure 
and 


71 = 


m{mdx — 1) 
dx{d?x - 1 ) 


72 = 


m{dx — m) 

dfcidx - 1 )' 
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C Forward classical communication does not increase quantum 
capacity 

In this appendix we establish that forward classical communication between Alice and Bob does 
not increase the one-shot quantum capacities defined in Sections 3.1 and 4. 

Lemma C.l. [2] For any quantum channel M : B, allowing forward classical communication 

between Alice and Bob does not increase any of the quantum capacities Qlg{J\f), and 

defined in (3.2), (4.5), and (4.2), respectively. 

Proof. Let us prove this statement for the one-shot entanglement generation capacity Qlg{Af); the 
claim then follows analogously for and 

We consider a modified entanglement generation protocol in which Alice, in addition to sending 
a quantum state to Bob through the channel Jf, is allowed to send classical information. Without 
loss of generality, we model this forward classical communication between Alice and Bob as follows: 
Alice prepares a pure state |T^^), with some probability px, where x G A is some classical label 
from a finite set A that she sends to Bob using a noiseless classical channel. Upon receiving the B- 
part of the state (id^ (8>AA)(T^^) as well as the label x, Bob applies a (CPTP) decoding operation 
V^, resulting in the average state 

E ,^Px(idij®P"oAr)(T))^). (C.l) 

The fidelity of this modified entanglement generation protocol is given by 

< maxF^ ($- 5 „ (id^®P" o Ar)(T)^^)), 
x^X 

where in the second equality we used the fact that the squared fidelity of a pure state and an 
arbitrary mixed state is linear in the latter. We conclude that if the code (m, m.cix) 

is an e-error code for entanglement generation assisted with forward classical communication, then 
there exists some xq G A such that (m, is an entanglement generation code (without 

classical communication) with error e. The proof then follows from the definition of Q\g{J\f). □ 


D Proof of Theorem 3.2: Analysis of the second term in (3.12) 

Recall that 

T2= [ d^i{P) j d^(V’) 

Jp J\'ip)Gsupp{P) 

Let us denote the 1-norm expression under the integral by T^iP,^!), and observe that 

r'(p,^) < II (r-'(g o aTo r^) • \\m (5-^2 
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Since IV') G supp(P), we have 'ip < P and 


<^~Ps)°^i.PpPpp) 

= p~J(Pm) 

< Ib- 


Therefore, 


T^iP,P^)< 

J\f - v^V’ 

VP) 

< 

s~^P^/p'^P^/ps~^P - v^V’v^ 

i’ 


where in the last line we used the fact that AA is a CPTP map. 

Let \(j)) = and \4>') = ^\'ip). Then, as verified in (3.10), the vector \(p) is normalized. 

Thus, a straightforward calculation yields 




1 + {'tplpl'ipf - (iplpl'ip) - (-01 


Then, using the concavity of the square root function we have 


'|i/’>6supp(P) 




< 


dp{ip) (i + OjIp lipf - (V'l p lip) - {ipl IV’)' 


1 1/2 


'|V’>esupp(P) 

We compute each term under the integral individually. The easiest one is the third one: 


/ 

Jl'ih 


dp{ip) (V’l pIV’) = 


f \ d 

' dp{'ip)tr{p'ip) = —tr(/5P) = —tr(pP) 

IV’)Ssupp(P) m m 


'li/lesuppiP) 

For the second term, we compute using Lemma B.2: 

[ d/r(V’) (V’IpIV’)^ = / dp{'^p)tI{p^pp'^p) 

■/|V’}Ssupp(P) J\ip)&upp(P) 


'|V’}6supp(P) 


d^(V’) tr(Pp(pV’ ® P'lp)) 


'\tp)&upp{P) 

1 


d/i(V’)tr(Pp(p(8) /5)(V’ <8) Ip)) 


m{m + 1) 
1 

m{m + 1) 


tr(Pp [p® p){P ® P + Fp)) 
(tr ((pP)^) + (tr(pP))2) 
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We express the last term as 

(V’l = (V'l {ii)\ 

= expL>2 {ypi}\/p\\S^ ■ 

Putting these together, we find that 


'\-ip)&upp{P) 


d/^W^2(-P>'0) < 


1 + . ^ . (tr {{pP)^) + (tr(pP))2) 

m[m + Ij 

d C 

-tr(pP) - / d/r(?/;)expi :»2 

^ ^|'i/>)Ssupp(P) ^ 


1 1/2 


Again using the concavity of the square root function, we obtain 

2 


rp2 _ 

-^2 — 


< 1 


/ d/i(P) / d/r(V’)T'(P,V’) 

'P |'0)Gsupp(P) 


+ —^ [ d/r(P)tr ((pP)^) + . ^ . [ dp(P)(tr(pP))2 


m 7 p 


d//(P)tr(pP) - f dp(P) f d/i(V’)expP 2 (v^V’y/^ll'S') 

Jp J \^p)Gs\lpp{P) ^ ^ 


Observe that 


/ dp(P)P = —/ and [ dp{P)P ^ P = Ia + i^Pa 

Jp d Jp 


where 


7 = 


m{md — 1) 


and K = 


d(d2 - 1) 

We note that '-yd? + Kd = tr(P ® P) = m?. Therefore, 


m(d — m) 
d{d? - 1) ■ 


— / dp(P)tr(pP) = 1, 

m Jp 


and 


We similarly have 


dp(P) tr {{pPf) = dp(P) tr(PA(p ® p){P ® P)) 

= -fiT{FA{p® p)) + Ktx{FA{p® p)Fa) 

= 7tr(p^) + Ktr(p)^ 

= 7(i^ tr(p^) + kJF. 

j dp(P)(tr(pP))^ = j dp{P) tr((p (g) p)(P (g) P)) 

= 7tr(p(g) p) + «;tr((p (g) p)Fa) 

= '-ydJ + fcd^ tr (p^) . 
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Putting these together, we arrive at 


r| < 


m{m 
< 1 + 


+ + dp,{P) [ dp{'tlj)expD2(y/^'ipy/^\\s) 

i + J-J Jp J|i/>>esupp(P) ^ ^ 

- / dp{P) [ d/r('0)expL»2 (V^V’v/^ll'S’) , 

Jp J|V'>6supp(P) ^ ^ 


where in the second line we used '’yd? < m? and < m. To get an upper bound on this expression, 
we need a lower bound on the last term with the double integral. For this we repeat the same process 
as before: we first change the order of integral, then write the result in terms of dephasing maps, 
and finally use Lemma 3.5. We first have 

[ [ dp(V’)expD2 (v^V'V^lls) 

Jp J\p)esupp{P) ^ ' 

= / d/r(P) / d^(i/>)expi :)2 ( 


= [ d/r(V’) [ d/i(P')expD2 ( 

Jp J P':swpp{P')C.\p)^ 


> 


Ip 


dV’exp 1)2 {ypip^/pWa^tp^ + , 


where in the last line we used the joint convexity of expD2(‘||')' Next, writing the result in terms 
of dephasing maps and using Lemma 3.5 as before, we arrive at 


[ dp{P) [ d//('i/’) exp i)2 ——- exp D 2 (dl^Wa'dl'^ +/J'ttr ^ p) . 

Jp J\P)esupp(P) ^ ™ 


'|b)6supp(P) 

Finally, using Lemma A.6 we find that for any <52 £ (0,1) we have 


/ dp{P) / dpi'ijj) exp D 2 

Ip -/|i/;)Esupp(P) 




a' + 13' exp TTp <8) p)J 


1 -1 


We now assume that 


^(1-52) a'+/3'exp (^-Dl'^{d>P\\TTR^ p)^ > 1-62 

for some £2 > 0. Repeating the same calculations as before, this inequality holds if 

£2 — ^2 


logm < Dg^{dlP\\Iji (g) p) + log 


I -£2 


Then assuming the above inequality, we obtain 


r| < 1 + tr(p^) - (1 - £ 2 ) < tr(p 2 ) + 62 . 
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E Proof of Equation (3.20) 

Observe that 


£2 — ^2 ^ 1 

log-- > -logn, 

1-62 


and that 


dS2 ® > Ds- ® p®”) , 

since 82 > 3/n by (3.18). Then it suffices to show that 

d} > nD{^P\\lR® p) - 0{\ognV^). (E.l) 

Using Lemma A.7 we find that 

Df ((.I/P)®-||/^„ ® p®-) > {{^PniRu ® p®-) - logu{lRn ® p®-) + log(l/n - 2/n2) 

where i^(/_Rn 0 /o®”) is the number of different eigenvalues of Iru (g) /o®”. We note that i^(/_Rn <8) p®"’) 
grows polynomially in n. 

Let .S = B^ i_ 2 in We continue to bound: 


D 


(('h^)®"||/R. ® p®") = min ® P®" 


> min min L>max(wn||-fR" <8) fT„) 

<^71 (jJ-n^lS 

= min minD^a_^{uJn\\IRr^ ® (t„) 


= — max 


- mini:)max(Wn||-fR" ® fT„ 

<^n 




_ ^fT=Wr 

— Xl 1 


{A^)p 0 n, 


where in the last line we used the duality of the min- and max-entropies, Lemma A.3(i). 
Next, using Lemma A.3(ii), we have 


^ (A"')p®u -h log I 1 - [ Y ^ ~ 


Observe that the following holds for sufficiently large n: 

2 


Therefore, 


1 - 


. 2 1 

1-1—2 

n 



( 

2 

1 1 

= 1 - 

1- 

-h 



V 

n 

rr 

2 

1 

1 

1 2 





n 



/ n 

1 




> -. 




n 





1 - - 
n 


H, 


£J^{A^)p^. > H]i^{A^)p^r. - O(logn). 
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Finally, using Lemma A.4 we have 


> ^H{A)p - O (\/^(W)) 

where g{t) = — log(l — Vl — Observe that Vl — < 1 — t^/2. Thus, — log(l — Vl — < 

— logt^/2, and hence 

■^min > nH{A)p - 0(logn)^/n. 

Putting all the above inequalities together yields (3.20). 
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